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DESCRIPTION 

PICTURE CODING METHOD, PICTURE DECODING METHOD, PICTURE 
CODING APPARATUS, PICTURE DECODING APPARATUS, AND 
PROGRAM THEREOF 

5 

CROSS REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional 
Application No. 60/538,065, filed January 20, 2004, and the benefit 
of U.S. Provisional Application No. 60/561,351, filed April 12, 2004 
10 the contents of which are herein incorporated by reference in its 
entirety. 

Technical Field 

The present invention relates to a picture coding method for 
15 efficiently compressing moving pictures and to a picture decoding 
method for correctly decoding such compressed moving pictures, as 
well as to a picture coding apparatus, a picture decoding apparatus, 
and a program thereof. 

20 Background Art 

In the age of multimedia which integrally handles audio, video 
and other pixel values, existing information media, i.e., newspaper, 
magazine, television, radio, telephone and other means through 
which information is conveyed to people, have recently come to be 

25 included in the scope of multimedia. Generally, multimedia refers 
to something that is represented by associating not only characters, 
but also graphics, audio, and especially pictures and the like 
together. However, in order to include the aforementioned existing 
information media into the scope of multimedia, it appears as a 

30 prerequisite to represent such information in digital form. 

However, when calculating the amount of information 
contained in each of the aforementioned information media as the 
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amount of digital information, while the amount of information per 
character is 1 to 2 bytes in the case of characters, the amount of 
information to be required is 64Kbits per second in the case of audio 
(telephone quality), and lOOMbits per second in the case of moving 
5 pictures (current television reception quality). Therefore, it is not 
realistic for the aforementioned information media to handle such an 
enormous amount of information as it is in digital form. For 
example, although video phones are already in the actual use by 
using Integrated Services Digital Network (ISDN) which offers a 

10 transmission speed of 64Kbits/s to 1.5Mbits/s, it is not practical to 
transmit video of televisions and cameras directly through ISDN. 

Against this backdrop, information compression techniques 
have become required, and moving picture compression techniques 
compliant with H.261 and H.263 standards recommended by ITU-T 

15 (International Telecommunication Union-Telecommunication 
Standardization Sector) are employed for video phones, for example. 
Moreover, according to information compression techniques 
compliant with the MPEG-1 standard, it is possible to store picture 
information into an ordinary music CD (compact disc) together with 

20 sound information. 

Here, MPEG (Moving Picture Experts Group) is an 
international standard on compression of moving picture signals 
standardized by ISO/IEC (International Organization for 
Standardization, International Electrotechnical Commission), and 

25 MPEG-1 is a standard for compressing television signal information 
approximately into one hundredth so that moving picture signals 
can be transmitted at a rate of 1.5 Mbit/s. Furthermore, since a 
transmission speed achieved by the MPEG-1 standard is a 
middle-quality speed of about 1.5Mbit/s, MPEG-2, which was 

30 standardized with a view to satisfying requirements for further 
improved picture quality, allows data transmission equivalent in 
quality to television broadcasting through which moving picture 



signals are transmitted at a rate of 2 to 15Mbit/s. Moreover, 
MPEG-4 was standardized by the working group (ISO/IEC 
JTC1/SC29/WG11) which promoted the standardization of MPEG-1 
and MPEG-2. MPEG-4, which provides a higher compression ratio 
5 than that of MPEG-1 and MPEG-2 and which enables an object-based 
coding/decoding/operation, is capable of providing a new 
functionality required in this age of multimedia. At the beginning 
stage of standardization, MPEG-4 aimed at providing a low bit rate 
coding method, but it has been extended as a standard supporting 

10 more general coding that handles interlaced images as well as high 
bit rate coding. Currently, an effort has been made jointly by 
ISO/IEC and ITU-T for standardizing MPEG-4 AVC and ITU-T H.264 
as picture coding methods of the next generation that offer a higher 
compression ratio. As of August 2002, a committee draft (CD) is 

15 issued for a picture coding method of the next generation. 

In general, in coding of a moving picture, the amount of 
information is compressed by reducing redundancies in temporal 
and spatial directions. Therefore, in inter picture prediction coding 
aiming at reducing temporal redundancies, motion estimation and 

20 the generation of a predicative image are carried out on a 
block-by-block basis with reference to forward or backward 
picture(s), and coding is then performed on the difference value 
between the obtained predictive image and an image in the current 
picture to be coded. Here, "picture" is a term denoting one image. 

25 In the case of a progressive image, "Picture" means a frame, 
whereas it means a frame or fields in the case of an interlaced image. 
Here, "interlaced image" is an image of a frame composed of two 
fields which are separated in capture time. In coding and decoding 
of an interlaced image, it is possible to handle one frame as a frame 

30 as it is, as two fields, or as a frame structure or a field structure on 
a per-block basis within the frame. 

A picture to be coded using intra picture prediction without 
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reference to any pictures shall be referred to as an I picture. A 
picture to be coded using inter picture prediction with reference to 
only one picture shall be referred to as a P picture. And, a picture 
to be coded using inter picture prediction with reference to two 
pictures at the same time shall be referred to as a B picture. It is 
possible for a B picture to refer to two pictures which can be 
arbitrarily combined from forward/backward pictures in display 
order. Reference images (reference pictures) can be determined 
for each block serving as a basic coding/decoding unit. Distinction 
shall be made between such reference pictures by calling a 
reference picture to be described earlier in a coded bitstream as a 
first reference picture, and by calling a re_ference picture to be 
described later in the bitstream as a second reference picture. 
Note that as a condition for coding and decoding these types of 
pictures, pictures used for reference are required to be already 
coded and decoded. 

P pictures and B pictures are coded using motion 
compensated inter picture prediction. Coding by use of motion 
compensated inter picture prediction is a coding method that 
employs motion compensation in inter picture prediction coding. 
Unlike a method for performing prediction simply based on pixel 
values in a reference picture, motion estimation is a technique 
capable of improving prediction accuracy as well as reducing the 
amount of data by estimating the amount of motion (hereinafter 
referred to as "motion vector") of each part within a picture and 
further by performing prediction in consideration of such amount of 
motion. For example, it is possible to reduce the amount of data 
through motion compensation by estimating motion vectors of the 
current picture to be coded and then by coding prediction residuals 
between prediction values obtained by shifting only the amount of 
the respective motion vectors and the current picture to be coded. 
In this technique, motion vectors are also recorded or transmitted in 



coded form, since motion vector information is required at the time 
of decoding. 

Motion vectors are estimated on a per-macroblock basis. 
More specifically, a macroblock shall be previously fixed in the 
current picture to be coded, so as to estimate motion vectors by 
finding the position of the most similar reference block of such fixed 
macroblock within the search area in a reference picture. 

FIG. 1 is a diagram illustrating an example data structure of a 
bitstream. As FIG. 1 shows, the bitstream has a hierarchical 
structure such as below. The bitstream (Stream) is formed of more 
than one group of pictures (GOP). By using GOPs as basic coding 
units, it becomes possible to edit a moving picture as well as to 
make a random access. Each GOP is made up of plural pictures, 
each of which is one of I picture, P picture, and B picture. Each 
picture is further made up of plural slices. Each slice, which is a 
strip-shaped area within each picture, is made up of plural 
macroblocks. Moreover, each stream, GOP, picture, and slice 
includes a synchronization signal (sync) for indicating the ending 
point of each unit and a header (header) which is data common to 
said each unit. 

Note that when data is carried not in a bitstream being a 
sequence of streams, but in a packet and the like being a piecemeal 
unit, the header and the data portion, which is the other part than 
the header, may be carried separately. In such case, the header 
and the data portion shall not be incorporated into the same 
bitstream, as shown in FIG. 1. In the case of a packet, however, 
even when the header and the data portion are not transmitted 
contiguously, it is simply that the header corresponding to the data 
portion is carried in another packet. Therefore, even when the 
header and the data portion are not incorporated into the same 
bitstream, the concept of a coded bitstream described with 
reference to FIG. 1 is also applicable to packets. 



Generally speaking, the human sense of vision is more 
sensitive to the low frequency components than to the high 
frequency components. Furthermore, since the energy of the low 
frequency components in a picture signal is larger than that of the 
5 high frequency components, picture coding is performed in order 
from the low frequency components to the high frequency 
components. As a result, the number of bits required for coding the 
low frequency components is larger than that required for the high 
frequency components. 

10 In view of the above points, the existing coding methods use 

larger quantization steps for the high frequency components than 
for the low frequency components when quantizing transformation 
coefficients, which are obtained by orthogonal transformation, of 
the respective frequencies. This technique has made it possible for 

15 the conventional coding methods to achieve a large increase in 
compression ratio with a small loss of picture quality from the 
standpoint of viewers. 

Meanwhile, since quantization step sizes of the high 
frequency components with regard to the low frequency components 

20 depend on picture signal, a technique for changing the sizes of 
quantization steps for the respective frequency components on a 
picture-by-picture basis has been conventionally employed. A 
quantization matrix is used to derive quantization steps of the 
respective frequency components. FIG. 2 shows an example 

25 quantization matrix. In this drawing, the upper left component is a 
DC component, whereas rightward components are horizontal high 
frequency components and downward components are vertical high 
frequency components. The quantization matrix in FIG. 2 also 
indicates that a larger quantization step is applied to a larger value. 

30 Usually, it is possible to use different quantization matrices for each 
picture. The value indicating the size of a quantization step of each 
frequency component is fixed length-coded. Note that it is usual 
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that each component of a quantization matrix and the value of each 
quantization step are approximately proportional to each other, but 
it is not necessary to stick to such relationship as long as the 
correspondence between them is clearly defined. 
5 However, such conventional methods have a problem that 

coding efficiency is lowered by simply performing fixed length 
coding due to the fact that the values of the respective frequency 
components of a quantization matrix center within a certain range. 

10 DDscDosoire of Invemiltooini 

It is an object of the present invention to provide a picture 
coding method, a picture decoding method, a picture coding 
apparatus, a picture decoding apparatus, and a program that are 
capable of improving the efficiency of quantization matrix coding. 

15 In order to achieve the above object, the picture coding 

method according to the present invention is a picture coding 
method for coding a picture on a block-by-block basis through 
orthogonal transformation and quantization, and coding a 
quantization matrix that is used to derive quantization steps for 

20 frequencies of orthogonal transformation coefficients, the method 
comprising: calculating a difference value between each of 
frequency components included in the quantization matrix and a 
predetermined value corresponding to said each of the frequency 
components; and coding the difference value into a variable length 

25 code, wherein a code length of the variable length code is shorter as 
the difference value is smaller, or equal to a code length of a 
neighboring difference value of said difference value. Here, the 
predetermined value may be the value of the most-frequently 
occurring high frequency component or a constant indicating an 

30 average value of the frequency components or a value previously 
determined for each frequency component. 

With the above construction, since a difference value between 
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each frequency component and a predetermined value is calculated, 
the resulting difference values are smaller than the frequency 
components, and since not the frequency components themselves 
but the difference values are coded, it becomes possible to shorten 
5 the code length of each variable length code and therefore to 
improve coding efficiency. 

Here, the predetermined value may be a value of a frequency 
component corresponding to an immediately previous difference 
value. 

10 With the above construction, since there is usually a 

correlation between a frequency component and the previous 
frequency component, it is possible for each difference value to take 
a further smaller value and therefore to further improve coding 
efficiency. 

15 Here, difference values may be calculated in order from low 

frequencies to high frequencies of the frequency components 
included in the quantization matrix. 

With the above construction, since frequency components are 
coded from the lower frequencies to the higher frequencies, it is 

20 highly probable that each frequency component and its previous 
frequency component take similar values. Therefore, it becomes 
possible for each difference value to take a further smaller value in 
a more reliable manner and therefore to further improve coding 
efficiency. 

25 Here, each of the difference values may be represented as a 

remainder of 2 raised to a power of k (k is a constant). 

With the above construction, in the case where each 
difference value is represented as a remainder that results from 
dividing each difference value by 2 raised to a power of k (k = 8), it 

30 becomes possible for them to take small values that can be 
represented virtually by 8 bits and therefore to further reduce the 
number of variable length code bits. 



-8- 



Here, the above picture coding method may further comprise: 
judging whether or not there exist consecutive difference values 0 
corresponding to a latter part of the quantization matrix; and coding 
difference values up to an immediately previous difference value of 
5 a top difference value 0 into variable length codes, when the 
judgment is that there exist consecutive difference values 0, without 
coding the consecutive difference values 0 into variable length 
codes. 

Here, the above picture coding method may further comprise 
10 adding an end code after the variable length code of the immediately 

previous difference value of the top difference value 0. 

With the above constructions, in the case where there are 

consecutive frequency components with the same value at the last 

of a quantization matrix, only one variable length code is generated 
15 for such plurality of frequency components, rather than generating a 

plurality of variable length codes with the same bit string. 

Accordingly, it becomes possible to further improve coding efficiency. 

In this case, by adding an end code, it becomes also possible to 

improve the efficiency of decoding performed by a picture decoding 
20 apparatus. 

Furthermore, in order to achieve the above object, the picture 
decoding method according to the present invention is a picture 
decoding method for decoding a coded picture on a block-by-block 
basis through inverse quantization and inverse orthogonal 

25 transformation, and decoding a variable-length coded quantization 
matrix, the method comprising: variable-length decoding the 
quantization matrix into a difference value corresponding to each of 
frequency components included in said quantization matrix; and 
calculating each of the frequency components of the quantization 

30 matrix by adding the difference value to a predetermined value 
corresponding to said each of the frequency components, wherein a 
code length of a variable length code of the difference value is 
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shorter as the difference value is smaller, or equal to a code length 

of a neighboring difference value of said difference value. 

Here, the predetermined value may be a value of a frequency 

component that is calculated by an immediately previous addition. 
5 Here, additions may be performed in order from low 

frequencies to high frequencies of the frequency components 

included in the quantization matrix. 

Here, each of the frequency components may be represented 

as a remainder of 2 raised to a power of k (k is a constant). 
10 With the above constructions, it becomes easy to decode 

variable length bit strings with which it is possible to improve coding 

efficiency, through the above coding method. 

Here, in the case where an end code is detected from the 

variable-length coded quantization matrix, a value that is the same 
15 as a value of an immediately previous frequency component of the 

end code may be outputted as a value of each of the subsequent 

frequency components. 

With the above construction, in the case where there are 

consecutive frequency components with the same value at the last 
20 of a quantization matrix, it becomes possible to decode only one 

variable length code to obtain all of such plurality of frequency 

components. Furthermore, an end code makes it possible to easily 

judge that there are consecutive frequency components with the 

same value at the last of a quantization matrix. 
25 Note that the picture coding apparatus, picture decoding 

apparatus, and program according to the present invention also 

have the same constructions as well as the same effects as above. 

BrSeff Description off Drawings 
30 These and other objects, advantages and features of the 

invention will become apparent from the following description 
thereof taken in conjunction with the accompanying drawings that 
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illustrate a specific embodiment of the invention. In the Drawings: 
FIG. 1 is a diagram illustrating an example data structure of a 
bitstream; 

FIG. 2 is a diagram showing an example quantization matrix; 
5 FIG. 3 is a block diagram showing a construction of a picture 

coding apparatus; 

FIG. 4 is a block diagram showing a construction of a WM 
coding unit; 

FIGs. 5A to 5D are diagrams, each showing an order of coding 
10 a quantization matrix; 

FIG. 6A is a diagram showing a concrete example of 
component values of a quantization matrix; 

FIG. 6B is a diagram showing a concrete example of 
difference values in a process of coding a quantization matrix; 
15 FIG. 6C is a diagram showing a concrete example of codes 

through processes of calculating difference values from frequency 
components and coding variable length codes; 

FIG. 7 is a flowchart showing coding processing performed by 
the WM coding unit; 
20 FIG. 8A is a diagram showing a concrete example of variable 

length coding in the case where difference values are positive; 

FIG. 8B is a diagram showing a concrete example of variable 
length coding in the case where difference values are signed values; 
FIG. 8C is a diagram showing a concrete example of variable 
25 length coding; 

FIG. 9A is a diagram showing an arrangement of frequency 
components of a quantization matrix; 

FIGs. 9B and 9C are diagrams showing how each coded data 
of the frequency components of the quantization matrix is arranged 
30 in a header; 

FIG. 10 is a block diagram showing a construction of a picture 
decoding apparatus; 
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FIG. 11 is a block diagram showing a construction of a WM 
decoding unit; 

FIG. 12 is a flowchart showing processing performed by the 
WM decoding unit; 

FIGs. 13A to 13C are diagrams illustrating a recording 
medium that stores a program for realizing, by a computer system, 
the picture coding method and the picture decoding method 
according to the above embodiment; 

FIG. 14 is a block diagram showing an overall configuration of 
a content supply system; 

FIG. 15 is a diagram showing an external view of a cellular 
phone that employs the picture coding method and the picture 
decoding method; 

FIG. 16 is a block diagram showing a construction of the 
cellular phone; and 

FIG. 17 is a diagram showing an overall configuration of a 
digital broadcasting system. 

Best Mode ffoir CarryDimg Out the Iimveinitiioni 
20 (First Embodiment) 

Referring to FIGs. 3 to 17, the first embodiment of the present 
invention is described. 

FIG. 3 is a block diagram showing the construction of a picture 
coding apparatus according to the first embodiment of the present 
25 invention. 

A picture coding apparatus 1 is an apparatus for performing 
compression coding on an input picture signal Vin and outputting a 
coded picture signal Str which has been coded into a bitstream by 
performing variable length coding and the like. Such picture coding 
30 apparatus 1 is comprised of a motion estimation unit ME, a motion 
compensation unit MC, a subtraction unit Sub, an orthogonal 
transformation unit T, a quantization unit Q, an inverse quantization 
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unit IQ, an inverse orthogonal transformation unit IT, an addition 
unit Add, a picture memory PicMem, a switch SW, and a variable 
length coding unit VLC. 

The picture signal Vin is inputted to the subtraction unit Sub 
5 and the motion estimation unit ME. The subtraction unit Sub 
calculates a residual image between each image in the input picture 
signal Vin and each predictive image on a block-by-block basis, and 
outputs the calculated residual image to the orthogonal 
transformation unit T. The orthogonal transformation unit T 

10 performs orthogonal transformation on the residual image to 
transform it into frequency coefficients, and outputs them to the 
quantization unit Q. 

The quantization unit Q quantizes the frequency coefficients 
of each block that are inputted from the subtraction unit Sub, using 

15 quantization steps that are derived by reference to a quantization 
matrix WM, and outputs the resulting quantized values Qcoef to the 
variable length coding unit VLC. 

The inverse quantization unit IQ performs inverse 
quantization on the quantized values Qcoef using quantization steps 

20 that are derived by reference to the quantization matrix WM, so as 
to turn them into the frequency coefficients, and outputs them to 
the inverse orthogonal transformation unit IT. The inverse 
orthogonal transformation unit IT performs inverse frequency 
transformation on the frequency coefficients so as to transform 

25 them into a residual image, and outputs it to the addition unit Add. 
The addition unit Add adds each residual image and each predictive 
image outputted from the motion estimation unit MC, so as to form 
a decoded image. The switch SW turns ON when it is indicated that 
such decoded image should be stored, and such decoded image is to 

30 be stored into the picture memory PicMem. 

Meanwhile, the motion estimation unit ME, which receives the 
picture signal Vin on a macroblock basis, detects an image area 
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closest to such input image signal Vin from among decoded pictures 
stored in the picture memory PicMem, and determines motion 
vector(s) MV indicating the position of such area. Motion vectors 
are estimated for each block, which is obtained by further dividing a 
5 macrobiock. When this is done, it is possible to use more than one 
picture as reference pictures. Here, since a plurality of pictures can 
be used as reference pictures, identification numbers (reference 
index Index) to identify the respective reference pictures are 
required on a block-by-block basis. The association between a 
10 reference index and the picture number of each picture stored in the 
picture memory PicMem makes it possible for a reference picture to 
be designated. 

The motion compensation unit MC reads out an optimum 
picture as a predictive picture from among the decoded pictures 
15 stored in the picture memory PicMem, based on the motion vectors 
detected in the above processing and on the reference indices 
Index. 

i 
i 

The variable length coding unit VLC performs variable length 
coding on each of the quantization matrices WM, quantized values 

20 Qcoef, reference indices Index, and motion vectors MV so as to 
output them as a coded stream Str. In order to do this, the variable 
length coding unit VLC includes a first coding unit and a second 
coding unit. The first coding unit (hereinafter referred to as "WM 
coding unit") performs variable length coding on each quantization 

25 matrix WM, whereas the second coding unit performs variable length 
coding on data other than the quantization matrices WM, i.e., each 
of the quantized values Qcoef, reference indices Index, and motion 
vectors MV. The WM coding unit calculates a difference value 
between each frequency component in each quantization matrix and 

30 a predetermined value corresponding to such frequency component, 
and codes such calculated difference into a variable length code. 
The smaller the different value, the shorter the length of the 
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resulting variable length code becomes, excluding some exceptions. 

FIG. 4 is a block diagram showing the construction of the WM 
coding unit. As this drawing shows, the WM coding unit is 
comprised of a subtraction unit 41, a W buffer 42, an offset 
5 determination unit 43, an addition unit 44, a block buffer 45, a 
number determination unit 46, an end code holding unit 47, a switch 
48, and a variable length coding unit 49. 

The subtraction unit 41 calculates a difference value between 
each frequency component in the quantization matrix and a 

10 predetermined value corresponding to such frequency component. 
Here, the predetermined value may be the value of the 
most-frequently occurring high-frequency component or a constant 
indicating an average value of the frequency components or a value 
previously determined for each frequency component. In the 

15 present embodiment, the frequency component value corresponding 
to the previously determined difference value is used as a 
predetermined value. In this case, the subtraction unit 41 
calculates a difference value between the current frequency 
component W[i] inputted from outside and the previous frequency 

20 component W[i-1] held in the W buffer 42. Since there is usually a 
correlation between the frequency component W[i] and the previous 
frequency component W[i-1], latter difference values become 
smaller. Note that "W[i]" indicates that this is the i-th frequency 
component in coding order to be coded. 

25 FIGs. 5A to 5D are diagrams, each showing an example order 

of the frequency components of a quantization matrix to be inputted 
to the subtraction unit 41. These orders indicate a scanning order 
in which a quantization matrix to be inputted to the subtraction unit 
41 is scanned. Orthogonal transformation in picture coding is most 

30 frequently performed for each 4x4 pixels or for each 8x8 pixels. 
In view of this, FIGs. 5A and 5B illustrate an example of 4x4 pixels 
and FIGs. 5C and 5D illustrate an example of 8x8 pixels. Since 
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each frequency component in a quantization matrix is coded on an 
individual basis according to one of the patterns shown in FIGs. 8A, 
8B, and 8C, there is no difference in coding efficiency regardless of 
whether coding is performed from lower frequency components to 
5 higher frequency components as shown in FIGs. 5A and 5C or simply 
in horizontal order as shown in FIGs. 5B and 5D. 

FIG. 6A is a diagram showing some of the frequency 
components of a quantization matrix inputted to the subtraction unit 
41. W[0], W[l] 7 W[2], W[3], ... shown in this drawing are in coding 

10 order shown in FIG. 5B. 

The W buffer 42 is a buffer for holding the previous frequency 
component W[i-1]. Before coding starts, the W buffer 42 holds, as 
an initial value W[-l], a value that is most typically used as a DC 
component. In the present embodiment, 8 is used as the initial 

15 value W[-l]. Note that the W buffer 42 may hold a predetermined 
value K that corresponds to each frequency component, instead of 
holding the previous frequency component W[i-1], 

The offset determination unit 43 determines an offset value 
that is used to convert the difference value (W[i]-W[i-1]) inputted 

20 from the subtraction unit 41 into a remainder by dividing such 
difference by 2 raised to a power of k, where k = 8 in this case. More 
specifically, the offset determination unit 43 outputs one of -256, 0, 
and 256 so that a result of adding an offset value to (W[i]-W[i-1]) 
falls within the range of values between -128 and 127. 

25 The addition unit 44 calculates a difference value D[i] by 

adding the difference value (W[i]-W[i-1]) from the subtraction unit 
41 to the offset value from the offset determination unit 43. Note 
that "D[i]" indicates that this is the i-th difference value in coding 
order to be coded. 

30 FIG. 6B is a diagram showing an example of difference values 

D[i] calculated by the addition unit 44. In this drawing, the 
difference values D[0], D[l], D[2], D[3], ... correspond to the 
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frequency components W[0], W[l], W[2], W[3], ... shown in FIG. 6A, 
respectively. 

The block buffer 45 is a buffer for holding difference values 
D[i] inputted from the addition unit 44 equivalent to one block 
corresponding to one quantization matrix. 

The number determination unit 46 counts the number of 
difference values 0 (W[i]-W[i-1] = 0) that exist consecutively at the 
last of scanning order (at the last of the block), and controls the 
switch 48 so that D[0] to D[M] held in the block buffer 45 are 
outputted. Here, D[M] is the previous difference value of the top 
difference value 0 of the above consecutive difference values 0. 

The end code holding unit 47 holds an end code that indicates 
the end of the variable length codes of a quantization matrix. Any 
value that cannot be difference values D[i] may serve as an end 
code. 

The switch 48 selects "block buffer output" so that D[0] to 
D[M] held in the block buffer 45 are outputted, and then selects "end 
code holding unit output" so that one end code is outputted from the 
end code holding unit 47. In other words, the switch 48 outputs the 
following: the difference values starting with the difference value 
D[0] corresponding to the top frequency component of a 
quantization matrix through to the difference value D[M] at the 
previous difference value of the consecutive difference values 0 that 
exist at the last of the quantization matrix; and one end code. Note 
that the switch 48 does not output an end code in the case where the 
number of consecutive difference values 0 (W[i]-W[i-1] = 0) counted 
by the number determination unit 46 is zero, the consecutive 
difference values existing at the end of the scanning order (at the 
last of the block). 

The variable length coding unit 49 performs variable length 
coding on the difference values inputted from the block buffer 45 via 
the switch 48 and on the end code so that the length of the resulting 
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codes becomes shorter, or equal to a neighboring difference value as 
the difference values are smaller. 

FIG. 6C is a diagram showing a concrete example of codes 
resulting from the coding performed by the variable length coding 
5 unit 49 using codes shown in FIG. 8C. In this drawing, W[0] to 
W[4], and D[0] to D[4] correspond to FIGs. 6A and 6B. The 
variable length coding unit 49 performs coding so that the code 
length becomes 1 bit when the difference value is 0 and the code 
length becomes 3 bits when the difference value is 1, for example. 

10 As this example implies, variable length coding performed by the 
variable length coding unit 49 results in shorter code length as the 
difference value is smaller. Furthermore, all of the frequency 
components from W[30] to W[63] take the same value of 64, in 
which case their difference values D[30] to D[63] become all 0. In 

15 this case, no coding is performed on these difference values D[30] 
to D[63], and variable length coding is performed on the end code 
instead of on the code "1" for 0 of the difference value D[30]. 

FIG. 7 is a flowchart showing detailed coding processing 
performed by the WM coding unit for coding a quantization matrix. 

20 In this flowchart, "Num" denotes the number of frequency 
components (e.g., 16, 64) in the quantization matrix, whereas "i" 
and "j" denote variables used to count from 0 to (Num-1). 

First, the WM coding unit causes the W buffer 42 to hold the 
initial value W[-l] (e.g., 8) (S71), calculates a difference value of 

25 each frequency component of the quantization matrix through 
processes shown as a loop 1 (S72 to S77), and stores the resulting 
difference value into the block buffer 45. Then, the WM coding unit 
performs variable length coding on each difference value through 
processes shown as a loop 2 (S78 to S81). 

30 In the loop 1, the subtraction unit 41 calculates a difference 

value D between the current frequency component W[i] inputted 
from outside and the previous frequency component W[i-1] held in 



- 18 - 



the W buffer 42 (S73). The offset determination unit 43 
determines an offset value used to convert the difference D inputted 
from the subtraction unit 41 into a remainder by dividing such 
difference D by 2 raised to a power of k, where k = 8 in this case 
5 (S74). For example, in the case where W[i] takes a value in the 
range between -256 and -1-254, the offset determination unit 43 
makes the following determination: the offset value should be -256 
when D is 128 or greater; the offset value should be 256 when D is 
smaller than -128; and the offset value should be 0 when D takes 

10 another value in the above range. As a result, a value that is 
determined by adding an offset value to D (i.e., D[i]) becomes a 
remainder within the range between -128 and +127, where not only 
positive values but also negative values are included. Since a value 
taken by a difference value Dp] becomes smaller through this 

15 process, the length of a variable length code of D[i] obtained by 
performing coding using a code shown in FIG. 8C becomes shorter 
accordingly. 

The addition unit 44 calculates a difference value D[i] by 
adding the difference value D ( = W[i]-W[i-1]) from the subtraction 
20 unit 41 to an offset value from the offset determination unit 43 (S75 
and S76). 

In the loop 2, the number determination unit 46 determines 
the number of consecutive difference values 0 that exist at the last 
of the D[i] column (S79), and connects the switch 48 to "block buffer 
25 output" for the difference values up to the difference value D[M] that 
is the previous difference value of such consecutive difference 
values 0, and then connects the switch 48 to "end code holding unit 
output". 

The variable length coding unit 49 codes the difference values 
30 D[i] inputted from the block buffer 45 via the switch 48 and the end 
code inputted from the end code holding unit 47 via the switch 48 
(S82). Here, since the end code can take any value that cannot be 
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taken by D[i], the end code here is -W[M], i.e., W[M] = 0. This is 
because the value satisfying "W[M]=0" can be identified as the end 
code since the frequency components of the quantization matrix are 
positive. 

5 FIGs. 8A to 8C are diagrams, each showing an example of 

variable length coding performed by the variable length coding unit 
49. Exponential Golomb codes are used in these examples. 

FIG. 8A shows a first example of variable length codes. This 
drawing shows a relationship between code words (code) and 

10 difference values before being coded (value). This example is 
applicable to the case where difference values can take positive 
values. The larger the difference values, the less frequently they 
occur and the longer their code length becomes, whereas the 
smaller the difference values, the shorter the length of their code 

15 words becomes. Conversion from the value of each component into 
a code word can be easily carried out using an arithmetic expression 
shown in FIG. 8A. When the difference value is 7, for example, it is 
represented as 7 = 4 + 3 (2 raised to a power of 2 + binary "1 1"). The 
code word for this difference value 7 is represented as a bit string 

20 consisting of "000 (three 0s)", "1", and "11". "N" in FIG. 8A denotes 
the closest exponential of 2 that does not exceed each difference 
value. "X 0 XiX 2 ... X N -i" denotes a bit string indicating the value 
resulting from subtracting the closest exponential of 2 from each 
difference value. 

25 FIG. 8B shows a second example of variable length coding. 

In this drawing, the value column describes the frequency 
components W[i] of the quantization matrix (i.e., predetermined 
value K + difference value). This example is applicable to the case 
where the value of each frequency component W[i] of a quantization 

30 matrix is most likely to take the predetermined value K. Since the 
occurrence frequency of the predetermined value K is high, the code 
length becomes shortest when the value of a frequency component 
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is K, with the code length becoming longer as the value of a 
frequency component is distant from K. The predetermined value K 
may be W[i-1] 7 for example, or another previously determined 
constant. 

5 FIG. 8C shows a third example of variable length codes. This 

drawing shows a relationship between code words (code) and 
difference values before being coded (value). This example is 
applicable to the case where difference values can take not only 
positive values but also negative values. When each component of 

10 a quantization matrix is coded in order shown in one of FIGs. 5A to 
5D, the values of neighboring frequency components W[i-1] are 
strongly correlated. Thus, by coding a difference value between 
the previously coded component value W[i-1] and the current 
component value W[i] in order shown in one of FIGs. 5A to 5D, the 

15 resulting difference values center around 0. Thus, it becomes 
possible to further reduce the number of variable length code bits by 
assigning code words with shorter code length to values around 0 
and assigning code words with longer code length to values distant | 
from 0, as FIG. 8C shows. 

20 FIGs. 9A to 9C are diagrams showing a quantization matrix 

and the data structure of the quantization matrix coded in a stream. 
In FIGs. 9B and 9C, "Header" is equivalent to the header of a 
stream/GOP/picture shown in FIG. 1. FIG. 9A is a diagram showing 
an arrangement of the frequency components of a quantization 

25 matrix. In this drawing, "Wi,j" denotes a component of the 
quantization matrix in the j-th row in the i-th line. FIGs. 9B and 9C 
illustrate how each data obtained by coding each component of the 
quantization matrix is arranged in the header. "WeightingMatrix" 
represents a bitstream obtained by coding the quantization matrix. 

30 FIG. 9B shows a stream obtained by performing coding in order 
shown in FIG. 5C, whereas FIG. 9C shows a stream obtained by 
performing coding in order shown in FIG. 5D. Note that "Wi, j" in 
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the streams shown in FIGs. 9B and 9C indicates that this is a coded 
variable length code corresponding to Wi, j shown in FIG. 9A. 

As described above, since the WM coding unit according to the 
present embodiment determines a difference value D[i] between a 
5 frequency component W[i] and a predetermined value K, the 
difference value D[i] takes a value smaller than the frequency 
component W[i]. Furthermore, since the WM coding unit does not 
code a frequency component W[i] itself but each difference value 
D[i], it becomes possible to shorten the code length of each variable 

10 length code and therefore to improve coding efficiency. 

Moreover, the use of the previous frequency component 
W[i-1] as a predetermined value K makes it possible for each 
difference value D[i] to take a further smaller value, as a result of 
which coding efficiency is further improved. This is because there 

15 is usually a correlation between each frequency component W[i] and 
the previous frequency component W[i-1]. 

What is more, since frequency components W[i] are coded in 
order from the lower frequencies to the higher frequencies, it is 
highly probable that each frequency component W[i] and the 

20 previous frequency component W[i-1] take similar values. 
Therefore, it becomes possible for each difference value D[i] to take 
a further smaller value in a more reliable manner, as a result of 
which coding efficiency is further improved. ■-■ 

Furthermore, in the case where each difference value D[i] is 

25 represented as a remainder that results from dividing each 
difference value by 2 raised to a power of k (k = 8), it becomes 
possible for them to take small values that can be represented 
virtually by 8 bits and therefore to further reduce the number of 
variable length code bits. 

30 Moreover, in the case where there are concecutive frequency 

components with the same value at the last of a quantization matrix, 
no variable length code is generated for such plurality of frequency 
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components, rather than generating a plurality of variable length 
codes with the same bit string. Accordingly, it becomes possible to 
further improve coding efficiency. In this case, by adding an end 
code, it becomes also possible to improve the efficiency of decoding 
performed by a picture decoding apparatus. 

FIG. 10 is a block diagram showing the construction of a 
picture decoding apparatus according to the first embodiment of the 
present invention. In this drawing, units that operate in the same 
manner as that of the units in the picture coding apparatus shown in 
FIG. 3 are assigned the same numbers, and descriptions thereof are 
omitted. 

The variable length decoding unit VLD decodes the coded 
stream Str into the quantization matrices WM, quantized values 
Qcoef, reference indices Index, and motion vectors MV. In order to 
do this, the variable length decoding unit VLD includes a first 
decoding unit and a second decoding unit. The first decoding unit 
(hereinafter referred to as "WM decoding unit") performs variable 
length decoding on each coded quantization matrix WM, whereas the 
second decoding unit performs variable length decoding on 
information other than the coded quantization matrices WM, i.e., 
each of the coded quantized values Qcoef, reference indices Index, 
and motion vectors MV. The WM decoding unit determines the 
frequency components of each quantization matrix by 
variable- length decoding the variable- length -coded quantization 
matrix into difference values of the respective frequency 
components and then by adding each resulting difference value to a 
predetermined value corresponding to its frequency component. 

Decoding is performed on each of the quantization matrices 
QM, quantized values Qcoef, reference indices Index, and motion 
vectors MV through the picture memory PicMem, the motion 
compensation unit MC, and the inverse quantization unit IQ. This 
decoding is already described above with reference to the block 
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diagram of the picture coding apparatus shown in FIG. 3. 

FIG. 11 is a block diagram showing the construction of the WM 
decoding unit. As this drawing shows, the WM decoding unit is 
comprised of a variable length decoding unit 51, an addition unit 52, 
a remainder calculation unit 53, a switch 54, a W buffer 55, a switch 
56, and an end coefficient judgment unit 57. 

The variable length decoding unit 51 decodes the variable 
length code of each of the variable-length coded frequency 
components into a difference value D[i]. 

The addition unit 52 adds each difference value D[i] from the 
variable length decoding unit 51 to the value of the previous 
frequency component W[i-1] from the W buffer 55. 

The remainder calculation unit 53 determines each frequency 
component W[i] by adding each addition result of the addition unit 
52 to 2 raised to a power of k, and then by dividing the resulting 
value by 2 raised to a power of k to calculate a remainder. More 
specifically, the remainder calculation unit 53 carries out 
(D[i] + W[i-l] + 256)%256. Here, n %" denotes an operation for 
determining a remainder obtained by dividing the resulting value by 
256. 

The switch 54 turns on while no end code is outputted from 
the remainder calculation unit 53 and turns off after an end code is 
outputted. 

The W buffer 55 holds a frequency component W[i] inputted 
from the remainder calculation unit 53 via the switch 54 and outputs 
it to the addition unit 52 in the next cycle as a frequency component 
W[i-1]. The W buffer 55 further holds W[i-1] (e.g., 8) as an initial 
value. 

The switch 56 keeps selecting a frequency component W[i] 
inputted from the remainder calculation unit 53 via the switch 54 
while no end code is outputted from the remainder calculation unit 
53, whereas selecting the W buffer 55 after an end code is 
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outputted. 

The end coefficient judgment unit 57 judges whether an end 
code has been outputted from the remainder calculation unit 53 and 
controls the switch 54 and the switch 56 based on such judgment. 
5 As described above, it is possible to decode W[i] 

corresponding to all the frequency components. Note that even 
when no end code is detected, decoding of the quantization matrix is 
terminated at the completion of the decoding of difference values 
D[i] that are equivalent in number to the number of all the frequency 

10 components included in the block. 

FIG. 12 is a flowchart showing processing performed by the 
WM decoding unit. In this flowchart, "Num" denotes the number of 
frequency components (e.g., 16, 64) in the quantization matrix, 
whereas "i" denotes a variable used to count from 0 to (Num-1) and 

15 "j" denotes a variable used to count from "i" to (Num-1). As FIG. 
12 shows, the WM decoding unit first causes the W buffer 55 to hold 
the initial value W[M] (S121), performs variable length decoding 
and outputs the resulting codes through processes shown as a loop 
1 (S122 to S127), and outputs concecutive frequency coefficients 

20 with the same value through processes shown as a loop 2 (S128 to 
S130) after an end code is detected. 

In the loop 1, the variable length decoding unit 51 decodes an 
inputted variable length code into a difference value D[i] (S123). 
Then, the addition unit 52 adds the difference value from the 

25 variable length decoding unit 51 to the previous frequency 
component from the W buffer 55, and the remainder calculation unit 
53 carries out a calculation so that a remainder W satisfies the 
following (SI 24): W = (D[i] + W[i-l] + 256)%256. The end 
coefficient judgment unit 57 judges whether this remainder W is an 

30 end code or not (here, whether the remainder W is positive or 
negative) (S125). When the end coefficient judgment unit 57 
judges that the remainder W is not an end code, such remainder W 
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is outputted as a frequency component W[i-1] from the remainder 
calculation unit 53 via the switch, and is stored into the W buffer 55 
via the switch 54 at the same time (S126). Meanwhile, when the 
end coefficient judgment unit 57 judges that the remainder W is an 
end code, the processes of the loop 2 are then carried out. 

In the loop 2, when there are concecutive frequency 
components with the same value at the last of the quantization 
matrix, frequency components W[i-1] are consecutively outputted 
as frequency components W[i]. More specifically, frequency 
components W[i-1] held in the W buffer 55 are outputted via the 
switch 56 as the subsequent frequency components W[i] to 
W[Num-l] (S129). 

As described above, the WM decoding unit according to the 
present embodiment correctly decodes variable length codes that 
have been differential coded by the WM coding unit. Furthermore, 
when there are concecutive frequency components with the same 
value at the last of a quantization matrix, the WM decoding unit is 
required to decode only one variable length code to obtain all of such 
plurality of frequency components. In this case, an end code 
makes it possible to easily judge that there are consecutive 
frequency components with the same value at the last of a 
quantization matrix. 

Furthermore, if a program for realizing the picture coding 
method and the picture decoding method as shown in the 
aforementioned embodiment are recorded on a recording medium 
such as a flexible disk, it becomes possible to easily perform the 
processing presented in the above embodiment in an independent 
computer system. 

FIGs. 13A, 13B, and 13C are diagrams illustrating a recording 
medium that stores a program for realizing the picture coding 
method and the picture decoding method according to the 
aforementioned embodiment. 
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FIG. 13B shows an external view of a flexible disk viewed from 
the front, its schematic cross-sectional view, and the flexible disk 
itself, while FIG. 13A illustrates an example physical format of the 
flexible disk as a recording medium itself. The flexible disk FD is 
5 contained in a case F, and a plurality of tracks Tr are formed 
concentrically on the surface of the flexible disk FD in the radius 
direction from the periphery, each track being divided into 16 
sectors Se in the angular direction. Therefore, in the flexible disk 
storing the above-mentioned program, the picture coding method 

10 and the picture decoding method as such program is recorded in an 
area allocated for it on the flexible disk FD. 

Meanwhile, FIG. 13C shows the structure required for 
recording and reading out the program on and from the flexible disk 
FD. When the program realizing the above picture coding method 

15 and picture decoding method is to be recorded onto the flexible disk 
FD, such program shall be written by the use of the computer system 
Cs via a flexible disk drive FDD. Meanwhile, when the picture 
coding method and the picture decoding method are to be 
constructed in the computer system Cs through the program on the 

20 flexible disk FD, the program shall be read out from the flexible disk 
FD via the flexible disk drive FDD and then transferred to the 
computer system Cs. 

The above description is given on the assumption that a 
recording medium is a flexible disk, but an optical disc may also be 

25 used. In addition, the recording medium is not limited to this, and 
any other medium such as an IC card and a ROM cassette capable of 
recording a program can also be used. 

The following describes application examples of the picture 
coding method and picture decoding method as shown in the above 

30 embodiment as well as a system using them. 

FIG. 14 is a block diagram showing an overall configuration of 
a content supply system exlOO that realizes a content distribution 
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service. The area for providing a communication service is divided 
into cells of desired size, and base stations exl07~exl 10, which are 
fixed wireless stations, are placed in the respective cells. 

In this content supply system exlOO, devices such as a 
5 computer exlll, a PDA (Personal Digital Assistant) ex 112, a camera 
exll3, a cellular phone exll4, and a camera-equipped cellular 
phone exllS are respectively connected to the Internet exlOl via 
an Internet service provider exl02, a telephone network exl04, and 
the base stations exl07~exll0. 

10 However, the content supply system exlOO is not limited to 

the combination as shown in FIG. 14, and may be connected to a 
combination of any of them. Also, each of the devices may be 
connected directly to the telephone network exl04, not via the base 
stations exl07~exll0, which are fixed wireless stations. 

15 The camera exll3 is a device such as a digital video camera 

capable of shooting moving pictures. The cellular phone may be a 
cellular phone of a PDC (Personal Digital Communication) system, a 
CDMA (Code Division Multiple Access) system, a W-CDMA 
(Wideband-Code Division Multiple Access) system or a GSM (Global 

20 System for Mobile Communications) system, a PHS (Personal 
Handyphone system) or the like, and may be any one of these. 

Furthermore, a streaming server exl03 is connected to the 
camera exll3 via the base station exl09 and the telephone network 
exl04, which enables live distribution or the like based on coded 

25 data transmitted by the user using the camera exll3. Either the 
camera exll3 or a server and the like capable of data transmission 
processing may code the shot data. Also, moving picture data shot 
by a camera exll6 may be transmitted to the streaming server 
exl03 via the computer exlll. The camera exll6 is a device such 

30 as a digital camera capable of shooting still pictures and moving 
pictures. In this case, either the camera exll6 or the computer 
exlll may code the moving picture data. In this case, an LSI 
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exll7 included in the computer exlll or the camera exll6 
performs coding processing. Note that software for coding and 
decoding may be integrated into a certain type of storage medium 
(such as a CD-ROM, a flexible disk and a hard disk) that is a 
5 recording medium readable by the computer exlll and the like. 
Furthermore, the camera-equipped cellular phone exllS may 
transmit the moving picture data. This moving picture data is data 
coded by an LSI included in the cellular phone exllS. 

In this content supply system exlOO, content (e.g. a music 

10 live video) which has been shot by the user using the camera exll3, 
the camera exll6 or the like is coded in the same manner as the 
above-described embodiment and transmitted to the streaming 
server exl03, and the streaming server exl03 makes stream 
distribution of the content data to clients at their request. The 

15 clients here include the computer exl 1 1, the PDA exl 12, the camera 
exll3, the cellular phone exll4 and so forth capable of decoding 
the above coded data. The content supply system exlOO with the 
above configuration is a system that enables the clients to receive 
and reproduce the coded data and realizes personal broadcasting by 

20 allowing them to receive, decode and reproduce the data in real 
time. 

The picture coding apparatus and picture decoding apparatus 
presented in the above embodiment can be used for coding and 
decoding to be performed in each of the devices making up the 
25 above system. 

An explanation is given of a cellular phone as an example. 
FIG. 15 is a diagram showing the cellular phone exll5 that 
employs the picture coding method and the picture decoding method 
explained in the above embodiment. The cellular phone exllS has 
30 an antenna ex201 for transmitting/receiving radio waves to and 
from the base station exllO, a camera unit ex203 such as a CCD 
camera capable of shooting video and still pictures, a display unit 
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ex202 such as a liquid crystal display for displaying the data 
obtained by decoding video and the like shot by the camera unit 
ex203 and video and the like received by the antenna ex201, a main 
body equipped with a group of operation keys ex204, a voice output 
5 unit ex208 such as a speaker for outputting voices, a voice input unit 
ex205 such as a microphone for inputting voices, a recording 
medium ex207 for storing coded data or decoded data such as data 
of moving pictures or still pictures shot by the camera, data of 
received e-mails and moving picture data or still picture data, and a 

10 slot unit ex206 for enabling the recording medium ex207 to be 
attached to the cellular phone exllS. The recording medium ex207 
is embodied as a flash memory element, a kind of EEPROM 
(Electrically Erasable and Programmable Read Only Memory) that is 
an electrically erasable and rewritable nonvolatile memory, stored in 

15 a plastic case such as an SD card. 

Next, referring to FIG. 16, a description is given of the cellular 
phone exllS. In the cellular phone exll5, a main control unit 
ex311 for centrally controlling the display unit ex202 and each unit 
of the main body having the operation keys ex204 is configured in a 

20 manner in which a power supply circuit unit ex310, an operation 
input control unit ex304, a picture coding unit ex312, a camera 
interface unit ex303, an LCD (Liquid Crystal Display) control unit 
ex302, a picture decoding unit ex309, a multiplexing/demultiplexing 
unit ex308, a recording/reproducing unit ex307, a modem circuit 

25 unit ex306, and a voice processing unit ex305 are interconnected 
via a synchronous bus ex313. 

When a call-end key or a power key is turned on by a user 
operation, the power supply circuit unit ex310 supplies each unit 
with power from a battery pack, and activates the camera-equipped 

30 digital cellular phone exll5 to make it into a ready state. 

In the cellular phone exll5, the voice processing unit ex305 
converts a voice signal received by the voice input unit ex205 in 



-30- 



conversation mode into digital voice data under the control of the 
main control unit ex311 comprised of a CPU, a ROM, a RAM and 
others, the modem circuit unit ex306 performs spread spectrum 
processing on it, and a transmit/receive circuit unit ex301 performs 
5 digital-to-analog conversion processing and frequency 
transformation processing on the data, so as to transmit the 
resultant via the antenna ex201. Also, in the cellular phone exllS, 
data received by the antenna ex201 in conversation mode is 
amplified and performed of frequency transformation processing 
10 and analog-to-digital conversion processing, the modem circuit unit 
ex306 performs inverse spread spectrum processing on the 

i 

resultant, and the voice processing unit ex305 converts it into 
analog voice data, so as to output it via the voice output unit ex208. 
Furthermore, when sending an e-mail in data communication 

15 mode, text data of the e-mail inputted by operating the operation 
keys ex204 on the main body is sent out to the main control unit 
ex311 via the operation input control unit ex304. In the main , 
control unit ex311, after the modem circuit unit ex306 performs ! 
spread' spectrum processing on the text data and the 

20 transmit/receive circuit unit ex301 performs digital-to-analog 
conversion processing and frequency transformation processing on 
it, the resultant is transmitted to the base station exllO via the 
antenna ex201. 

When picture data is transmitted in data communication 

25 mode, the picture data shot by the camera unit ex203 is supplied to 
the picture coding unit ex312 via the camera interface unit ex303. 
When picture data is not to be transmitted, it is also possible to 
display such picture data shot by the camera unit ex203 directly on 
the display unit ex202 via the camera interface unit ex303 and the 

30 LCD control unit ex302. 

The picture coding unit ex312, which includes the picture 
coding apparatus according to the present invention, performs 
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compression coding on the picture data supplied from the camera 
unit ex203 using the coding method employed by the picture coding 
apparatus presented in the above embodiment, so as to convert it 
into coded picture data, and sends it out to the 
5 multiplexing/demultiplexing unit ex308. At this time, the cellular 
phone exll5 sends voices received by the voice input unit ex205 
while the shooting by the camera unit ex203 is taking place, to the 
multiplexing/demultiplexing unit ex308 as digital voice data via the 
voice processing unit ex305. 

10 The multiplexing/demultiplexing unit ex308 multiplexes the 

coded picture data supplied from the picture coding unit ex312 and 
the voice data supplied from the voice processing unit ex305 using 
a predetermined method, the modem circuit unit ex306 performs 
spread spectrum processing on the resulting multiplexed data, and 

15 the transmit/receive circuit unit ex301 performs digital-to-analog 
conversion processing and frequency transformation processing on 
the resultant, so as to transmit the processed data via the antenna 
ex201. 

When receiving, in data communication mode, moving picture 
20 file data which is linked to a Web page or the like, the modem circuit 
unit ex306 performs inverse spread spectrum processing on the. 
received signal received from the base station exllO via the 
antenna ex201, and sends out the resulting multiplexed data to the 
multiplexing/demultiplexing unit ex308. 
25 In order to decode the multiplexed data received via the 

antenna ex201, the multiplexing/demultiplexing unit ex308 
separates the multiplexed data into a coded bitstream of picture 
data and a coded bitstream of voice data, and supplies such coded 
picture data to the picture decoding unit ex309 and such voice data 
30 to the voice processing unit ex305 via the synchronous bus ex313. 

Next, the picture decoding unit ex309, which includes the 
picture decoding apparatus according to the present invention, 
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decodes the coded bitstream of the picture data using the decoding 
method paired with the coding method shown in the 
above-mentioned embodiment so as to generate moving picture 
data for reproduction, and supplies such data to the display unit 
5 ex202 via the LCD control unit ex302. Accordingly, moving picture 
data included in the moving picture file linked to a Web page, for 
instance, is displayed. At the same time, the voice processing unit 
ex305 converts the voice data into an analog voice signal, and then 
supplies this to the voice output unit ex208. Accordingly, voice 

10 data included in the moving picture file linked to a Web page, for 
instance, is reproduced. 

Note that the aforementioned system is not an exclusive 
example and therefore that at least either the picture coding 
apparatus or the picture decoding apparatus of the above 

15 embodiment can be incorporated into a digital broadcasting system 
as shown in FIG. 17, against the backdrop that satellite/terrestrial 
digital broadcasting has been a recent topic of conversation. To be 
more specific, at a broadcasting station ex409, a coded bitstream of 
video information is transmitted, by radio waves, to a satellite 

20 ex410 for communications or broadcasting. Upon receipt of it, the 
broadcast satellite ex410 transmits radio waves for broadcasting, an 
antenna ex406 of a house equipped with satellite broadcasting 
reception facilities receives such radio waves, and an apparatus 
such as a television (receiver) ex401 and a set top box (STP) ex407 

25 decodes the coded bitstream and reproduces the decoded data. 
The picture decoding apparatus as shown in the above-mentioned 
embodiment can be implemented in the reproduction apparatus 
ex403 for reading and decoding the coded bitstream recorded on a 
storage medium ex402 that is a recording medium such as a CD and 

30 a DVD. In this case, a reproduced video signal is displayed on a 
monitor ex404. It is also conceivable that the picture decoding 
apparatus is implemented in the set top box ex407 connected to a 
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cable ex405 for cable television or the antenna ex406 for 
satellite/terrestrial broadcasting so as to reproduce it on a television 
monitor ex408. In this case, the picture decoding apparatus may 
be incorporated into the television, not in the set top box. Or, a car 
5 ex412 with an antenna ex411 can receive a signal from the satellite 
ex410, the base station exl07 or the like, so as to reproduce a 
moving picture on a display device such as a car navigation system 
ex413 mounted on the car ex412. 

Furthermore, it is also possible to code a picture signal by the 

10 picture coding apparatus presented in the above embodiment and to 
record the resultant in a recording medium. Examples include a 
DVD recorder for recording a picture signal on a DVD disc ex421 and 
a recorder ex420 such as a disc recorder for recording a picture 
signal on a hard disk. Moreover, a picture signal can also be 

15 recorded in an SD card ex422. If the recorder ex420 is equipped 
with the picture decoding apparatus presented in the above 
embodiment, it is possible to reproduce a picture signal recorded on 
the DVD disc ex421 or in the SD card ex422, and display it on the 
monitor ex408. 

20 As the configuration of the car navigation system ex413, the 

configuration without the camera unit ex203 and the camera 
interface unit ex303, out of the configuration shown in FIG. 16, is 
conceivable. The same is applicable to the computer exlll, the 
television (receiver) ex401 and the like. 

25 Concerning the terminals such as the cellular phone exll4, a 

transmitting/receiving terminal having both an encoder and a 
decoder, as well as a transmitting terminal only with an encoder, and 
a receiving terminal only with a decoder are possible as forms of 
implementation. 

30 Note that each function block in the block diagram shown in 

FIGs. 3, 4, 10, and 11 can be realized as an LSI that is an integrated 
circuit apparatus. Such LSI may be incorporated in one or plural 
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chip form (e.g. function blocks other than a memory may be 
incorporated into a single chip). Here, LSI is taken as an example, 
but, it can be called "IC", "system LSI", "super LSI" and "ultra LSI" 
depending on the integration degree. 
5 The method for incorporation into an integrated circuit is not 

limited to the LSI, and it may be realized with a private line or a 
general processor. After manufacturing of LSI, a Field 
Programmable Gate Array (FPGA) that is programmable or a 
reconfigurable processor that can reconfigure the connection and 

10 settings for the circuit cell in the LSI may be utilized. 

Furthermore, along with the arrival of technique for 
incorporation into an integrated circuit that replaces the LSI owing 
to a progress in semiconductor technology or another technique that 
has deviated from it, integration of the function blocks may be 

15 carried out using the newly-arrived technology. Bio-technology 
may be cited as one of the examples. 

Among the function blocks, only a unit for storing data may be 
constructed separately without being incorporated in a chip form, as 
the storage medium 115 described in the present embodiment. 

20 Note that the main part in the function blocks shown in FIGs. 

3, 4, 10, and 11 or in the flowcharts shown in FIGs. 7 and to 12 can 
be realized by a processor or a program. 

As stated above, it is possible to employ the picture coding 
method and the picture decoding method presented in the above 

25 embodiment into any one of the above-described devices and 
systems. Accordingly, it becomes possible to achieve an effect 
described in the aforementioned embodiment. 



(Second Embodiment) 
30 The following describes the second embodiment of the 

present invention. 

The present invention is directed to enabling quantization of 
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matrices to be carried out and represented in a video coding system 
that contains intra-prediction and more particularly creating a series 
of cyclic difference values suitable for signed Exp-Golomb codes. 

The prior art is aware of quantization matrix for video coding 
5 and the applicability of Exp-Golomb codes. Video code has also 
been represented in hierarchical syntactic structure with associated 
semantics and has used both lossy and non-lossy or Variable Length 
Coding in video compression. Transform domain quantization has 
also been known and used. 
10 The present invention describes a method and program for 

representing, carrying and enabling video compression by using 
quantization weighting matrices and addresses the following 
features: 

1. Enable quantization matrices to be carried and 
15 represented in a video coding system that contains intra-prediction. 

2. Define and carry quantization weighting matrices that can 
be implemented by integer based and division free computation. 

3. Enable quantization matrix to be encoded in an efficient 
Cyclic Differential code and Exp-Golomb code. 

20 4. Support quantization matrix representation that allows 

redundant weighting values to be derived at decoder side rather 
than being carried over communication transport data. 

0. Introduction of differential coding for quantization matrix 
25 To code quantization matrix entries, the matrix entries are 

first mapped to a one dimensional series. The mapping uses zigzag 
pattern as described in MPEG-4 Part 10 (similarly defined in MPEG-2 
and MPEG-4 Part 2). An example of the zigzag pattern is given in 
FIG. 5A or 5B. The converted one dimensional series has 64 
30 elements for 8x8 quantization matrix and 16 elements for 4x4 
quantization matrix. For a quantization matrix defined, the 
elements are positive integer based with the range from 1 to 255, 
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inclusive. 

In current differential coding, we generate a differential code 
by subtracting the 1-d series element by the value of the previous 
element, with the exception that the first element in the series is 
5 subtracted by 8 (because the value for the first element is a weight 
for DC and is usually around 8). Then the differential codes are 
further coded using Exp-Golomb code, as described in MPEG-4 Part 
10. (ISO/IEC JTC1/SC29WG1 1, 14496-10) 

The current differential coding method of a quantization 
10 matrix has the following two problems: 

1. The difference between the current and the previous values 
of a Q-matrix can have a value anyway between [-254, +254]. The 
spread of this range makes variable length coding using Exp-Golomb 
inefficient. 

15 2. When a M truncating M approach is used to limit the number 

of matrix values to be coded, the special value to signal the end of 
the series of the data has to be reasonably sized. Otherwise it may 
make truncating approach inappropriate. 

The solution here creates a series of cyclic difference values, 

20 so that the range of the differences are all within the range of [-128, 
+ 128], therefore making it more suitable for signed Exp-Golomb 
codes. This also mitigates the problem in signaling the end of the 
code series when the "truncation" approach is used. 

Further toward an appropriate mechanism to signal the end of 

25 code series using the "truncation" approach, we refine here, so that 
the number of bits to be transmitted for signaling the end is equal or 
less than 9-bits. This can be achieved by using multiple signals to 
stop the series. This should be usable for code series of 64 codes 
long and maybe even usable for 16 also. 

30 

1. Inverse process of cyclic differential coding 

Input of the process is D[i], the differential code. 
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Output of the process is W[i], the sequence of Quantization 
Matrix values sampled along the zigzag path. 

The following operation is performed on the input W[i]. 

Define: M=16, for 4x4 block; M = 64, for 8x8 block 

Define M = 16, for 4x4 matrix or M = 64, for 8x8 matrix 

W[0] = (D[0] + 8 + 256)%256 
k = 0; 

for(i=l; (i<M)&&(W[i-l] + D[i] +256) % 256>0;i + + ) 
{ 

W[i] = (W[i-1] + D[i] +256)%256 

} 

for(j =I;j<M;j++){ 
W[j] = W[i-1] 

} 

2. Differential code generation process 

Input of the process is W[i], the sequence of Quantization 
Matrix values sampled along the zigzag path. 

Output of the process is D[i], the differential code generated 
from W[i]. 

The following operation is performed on the input W[i]. 

Define M = 16, for 4x4 matrix or M = 64, for 8x8 matrix 

D[0] =W[0]-8 
k=M; 

for(i=l; i<M; i + + ) 
{ 

D[i] = W[i] -W[i-1] 
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if( D[i]>128) 

D[i] = D[i]-256 
else if (D[i]<-128) 

D[i] = D[i] +256 
else if(D[i] = = 0) && (k== M) 

k=i 

else if(D[i]! = 0) 
k = M 

} 

// This is to insert the ending signal D[k] corresponding to 
W[k] = 0 
if(k» = M){ 

if (W[k-l]>128) 

D[k] = 256-W[k-l] 
else 

D[k] = -W[k-1] 

} 

// This is only for optimization 
if(Length_of-ExpGlomb_code (D[k]) > M-k-i ) { 
for(i = k; i<M; i + + ) 
D[k] = 0 

> 

This will generate a series of differential codes, D[i], where 
the absolute values are always equal to or less than 128, including 
the value indicating the end of the differential code series. 
Therefore these codes are more suitable for signed Exp-Golomb 
coding. 

3. Mechanism to signal the end of a differential code series 

Instead of using a signal value of W or D as a sign of the end 
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of the differential code series, we can use more than one code and 
the encoder can choose which one to send. 

The following are allowed by encoder to signal the end of a 
code series, which the decoder also must be able to recognize. 

Suppose the end of the series to be coded is at k, (i.e. W[k] = 
W[k-1]) for W[i], i = 0,...,M, and M is 16 for 4x4 transform block and 
64 for 8x8 block. 

- SetW[k] = 0. This corresponds to a difference code D[k] 
and the absolute value of D[k] should be less or equal to 128. 

- Set D[k] to be any values above 255 or below -255, which 
will have a signed Exp-Golomb code with the first 9 bit equal 
to zero, OOOOOOOOOlxxxxxxxxx. The encoder only needs to 
send the first 10 bits (can we only send 9 bit instead without 
sending "1"). After decoding, the decoder will get a value 
larger than 255 for signed Golomb code. If decoder can 
handle illegal Golomb code, then sending only 9 bit of 0s will 
be sufficient. 

Corresponding to this, the decoder procedure can be written 
as the follows. (In the following pseudo-code, instead of checking 
D[k] is above +255 or below -255, simply checking if D[i] is in the 
correct range, if not, then assume it is the end of the code sequence. 
Since the checking maybe needed anyway, it saves two conditional 
operations. 

Define: M = 16, for 4x4 block; M = 64, for 8x8 block 
Define M = 16, for 4x4 matrix or M = 64, for 8x8 matrix 
D[0] = (W[0] + 8 + 256)%256 
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k = 0; 

for (1=1; (i<M) &&(Golombcode Legal) && (-129< D[i] < 129) 
&& (W[i-1] +256) % 256>0; i + + ) 

W[i] = (W[i-1] + D[i] +256)%256 

5 { 

for(j = i;j<M;j + + ){ 
W[j] = W[i-1] 

> 

4. Further issues 

Fixed length code combined with Exp-Golomb codes are 
sometimes used to improve efficiency. 

- Generating Cyclic difference 

- 2 bit (or 4bit) LSB using fixed length code 

- 6bit (or 4bit) MSB using Exp-Golomb codes 
This should further reduce total number of bits. 

Note that the above-described "1. Inverse process of cyclic 
differential coding" can be described as derivation process for 
20 quantization weighting matrices as below. 

Input to this process is a sequence of Exp-Golomb coded 
quantization weighting matrix values. 

Output of this process is two dimensional array of 
quantization weighting matrix values. 
25 - The decoding process is invoked to decode the sequence of 

Exp-Golomb coded quantization weighting values. 

- The decoded values, d(i), are used to derive the 
quantization weighting matrix entries W(i). The following 
process is used in deriving the quantization weighting matrix 

30 entries. In the following equation, M = 64 for 8X8 

quantization matrix and M = 16 for 4X4 quantization weighting 
matrix. 
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- W(0) = d(0) + 8 

- If ((d(i)<128) && (d(i)>-128) && ((W(i-l) + 
d(i) + 256)%256!=0)) 

W(i) = (W(i-l) + d(i) + 256)%256, for 0<i<M 
5 - Otherwise 

W(i) = W(k-l), for i = k, ...<M, where D(k)>128 or 
D(k)<-128, or W(k) = 0, and 0<i, k<M 
[W[I] is not allowed to be zero] 
- Quantization weighting matrix W(i, j) is derived from W(i) 
10 using the inverse block scanning process. The 4X4 

inverse scanning order is shown in FIG. 5A. The 8X8 
inverse scanning process is done as shown in FIG. 5B. 

Note that the above-described "1. Inverse process of cyclic 
15 differential coding" can also be described as below. In the following, 
"ScalingList" denotes quantization matrix WM, "lastScale" denotes 
W[i-1], and "delta_scaling" denotes D[i]: 



scaling_list( scalingList, sizeOfScalingList, useDefaultScalingMatrixFlag ) { 
20 lastScale = 8 

nextScale = 8 

for( j = 01 j < sizeOfScalingList; j++ ) { 

if( nextScale != 0 ) { 
delta_scale 

25 nextScale = ( lastScale + delta_scale + 256 ) % 256 

useDefaultScalingMatrixFlag = ( j = = 0 && nextScale = = 0 ) 

} 

scalingListt j ] = ( nextScale = = 0 ) ? lastScale • nextScale 
lastScale = scalingList! j ] 
30 } 
} 
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From the invention thus described, it will be obvious that the 
embodiments of the invention may be varied in many ways. Such 
variations are not to be regarded as a departure from the spirit and 
scope of the invention, and all such modifications as would be 
5 obvious to one skilled in the art are intended for inclusion within the 
scope of the following claims. 

Industrial Applicability 

The present invention is suited for use as a coding apparatus 
10 for coding a picture and a decoding apparatus for decoding a picture, 
as well as a web server for distributing a moving picture, a network 
terminal for receiving such moving picture, a digital camera capable 
of recording and reproducing a moving picture, a camera-equipped 
cellular phone, a DVD recorder/player, a PDA, a personal computer, 
15 and the like. 
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